-
-
Notifications
You must be signed in to change notification settings - Fork 2.8k
Implement uring in AsyncWrite for fs::File
#7713
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
tokio/src/fs/file.rs
Outdated
| } | ||
|
|
||
| cfg_io_uring! { | ||
| struct BoxedOp<T>(Pin<Box<dyn Future<Output = T> + Send + Sync + 'static>>); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Does the Future need to be Sync ?
I think it is not really necessary.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks like it does need Sync, at least its required by a test https://github.com/RedKinda/tokio/blob/uring_file/tokio/tests/async_send_sync.rs#L651 - https://github.com/tokio-rs/tokio/actions/runs/18909744941/job/53976898682?pr=7713#step:6:248
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In principle you could avoid Sync using src/util/sync_wrapper.rs. But for this use-case, it does not matter.
tokio/src/fs/file.rs
Outdated
| let handle = BoxedOp(Box::pin(async move { | ||
| let (r, buf, _fd) = op.await; | ||
| match r { | ||
| Ok(_n) => (Operation::Write(Ok(())), buf), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think there is a chance that the buffer is not fully drained here.
Write(Ok()) should be returned only if buf.is_empty().
I think you need to wrap the write in a loop and handle it with something like:
Ok(0) => break (Operation::Write(Err(io::ErrorKind::WriteZero.into())), buf),
Ok(_) if buf.is_empty() => break (Operation::Write(Ok(())), buf),
Ok(_) => continue, // more to write
Err(e) => break (Operation::Write(Err(e)), buf),There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good catch, thanks and fixed
tokio/src/fs/file.rs
Outdated
| let handle = BoxedOp(Box::pin(async move { | ||
| let (r, buf, _fd) = op.await; | ||
| match r { | ||
| Ok(_n) => (Operation::Write(Ok(())), buf), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same as above - make sure the buf is empty before saying Ok().
|
This should be ready for merging i think, is there anything else blocking it? |
|
Sorry I've been too focused on #7696. I will try to review this PR too. |
tokio/src/fs/file.rs
Outdated
| // Handle not present in some tests? | ||
| if let Ok(handle) = Handle::try_current() { | ||
| if handle.inner.driver().io().check_and_init()? { | ||
| task_join_handle = { | ||
| use crate::{io::uring::utils::ArcFd, runtime::driver::op::Op}; | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is way too many levels of indentation. We need to pull some logic into a separate function.
tokio/src/fs/file.rs
Outdated
| #[derive(Debug)] | ||
| enum JoinHandleInner<T> { | ||
| Blocking(JoinHandle<T>), | ||
| #[cfg(all( | ||
| tokio_unstable, | ||
| feature = "io-uring", | ||
| feature = "rt", | ||
| feature = "fs", | ||
| target_os = "linux" | ||
| ))] | ||
| Async(BoxedOp<T>), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Here is an idea:
What if you perform the io-uring logic in a tokio::spawn task? That way, you can use JoinHandle in both cases.
I think it will simply code a lot.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The Blocking(JoinHandle) is crate::blocking::JoinHandle not tokio::runtime::task::join::JoinHandle so some enum would be required anyway, and at that point might as well just have this kind of a boxed future
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That is a re-export of the same thing.
Line 9 in 4714ca1
| pub(crate) use crate::task::JoinHandle; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One reason that I suggest this is that the io-uring logic performs multiple writes in a loop to write the entire buffer, which requires the end-user to await flush or write on the file to make progress. However, the implementation we have today does not require the user to interact with the file to make progress - it happens in the background automatically.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah got it, I'll change it.
Out of curiosity, general rule is that for a future to keep making progress, user needs to keep polling it. I feel like not having to do that specifically for fs::File is just an implementation detail, and users shouldn't rely on that always being the case. So i guess my question is how much is it worth preserving old behavior like this when implementing new features, if it comes at the cost of (small) performance overhead? relevant XKCD
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Had to adjust mocks to accommodate non-blocking spawning https://github.com/tokio-rs/tokio/actions/runs/19464918802/job/55697554687 b91836c - can you doublecheck im doing it right?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This kind of thing is always a hard question. Though I would say that I'm pretty sure people rely on writes continuing even if the file is dropped, and it would be very hard to change that.
tokio/src/io/uring/write.rs
Outdated
| } | ||
|
|
||
| impl Op<Write> { | ||
| /// Issue a write that starts at `buf_offset` within `buf` and writes some bytes |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This docstring seems to be outdated. buf_offset is no more a parameter
tokio/src/fs/write.rs
Outdated
| let driver_handle = handle.inner.driver().io(); | ||
| if driver_handle.check_and_init()? { | ||
| return write_uring(path, contents).await; | ||
| use crate::io::blocking; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this import needed here ?
It is already imported at line 8
tokio/src/fs/file.rs
Outdated
| buf, | ||
| ); | ||
| } | ||
| Ok(_) if buf.is_empty() => { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this arm should be moved to be first, i.e. before the Ok(0) one.
Motivation
Currently IO operations for
fs::Fileis implemented usingspawn_blockingto perform IO, which is not great for an async runtime. This PR implements IO operations usingio_uring. Thefs::writefunction already usesio_uringfor async operations, and this PR extends the uring functionality intoimpl AsyncWrite for fs::File.For full discussion see here #7684 and subsequently on discord https://discord.com/channels/500028886025895936/810724255046172692/1427735341003051079
Solution
The
fs::Filestruct already implements an internal buffer when writing usingspawn_blocking, which is similar to whatio_uringrequires. Therefore inpoll_write, we can check ifio_uringis initialized and supported, and if not fall back to thespawn_blockingimplementation, without requiring any API changes.This PR only implements
AsyncWrite, and once this PR is accepted, I am happy to also open a PR forAsyncRead for Fileusing the same structure.